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We have analyzed the sequences of 77 nuclear genes of TV. crassa thought to be transcribed by RNA 
polymerase II (References 1-72) which should represent virtually all of the presently published nuclear 
gene sequences for this fungus. Kozak (1988, Nucl. Acids Res. 15:8125-) analyzed 699 vertebrate genes 
leading to identification of the vertebrate consensus sequence for initiation of translation, or Kozak 
Sequence: 

G44C39C53( A6 1 /G36)(C49/A27)C55 A 1 00T 1 00G 1 00G46 
We show here that the N. crassa Kozak sequence is: 
C57NNNC77 A8 1 ( A44/C43)"T"3 A99T 1 00G99G5 1 C53 

where the subscript number indicates the % occurrence of the particular nucleotide and "T" indicates the 
conserved absence of that particular nucleotide. 

We arbitrarily decided that a nucleotide was to be included in the consensus only if it was present in at least 
50% of all the sequences analyzed. If two nucleotides, each represented at less than 50%, gave a summed 
total of at least 75% representation for a single position, then both are shown in brackets. 

Table I. Consensus for initiation of translation and stop codons in Neurospora crassa 
No. Ref. Gene Distance from +1 Kozak Sequence Stop codon 



to ATG (bases) 



Consensus : CNNNCAATGGC 
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AATATCACAATGGCG 
CTGCCCATCATGGCT 
ATACGAGTTATGGCG 
TCACCAACCATGGCG 
CTTTTCACAATGGCT 
ACAGACAAAATGGCT 
CACGTCACCATGGCC 
TCCCTCACCATGACC 
ACCTTCAAAATGTCT 
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Key: - in the Distance from +1 to ATG (bases) means that the authors had not determined the +1 position 

The reason why the methionine start codon (ATG) is not 100% perfectly conserved within the Kozak 
consensus is that, for reasons unknown, the gene frq (Ref 33) starts its protein sequence with a valine 
(GTT). 

It is also interesting to note that the choice of the second codon appears to be limited in that about half of 
the second codons have a guanosine in the first position and another half have a cytosine in the second 
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position. 

On the whole, our consensus shows a good resemblance to the mammalian Kozak sequence with a similar 
hierarchy of nucleotide preference for a given position, although the degree of preference may be shifted. 
An exception is the nucleotide position immediately preceding the initiator methionine codon (ATG) where 
M. crassa exhibits a definite suppression of thymine in contrast to a positive preference for any other 
nucleotide. 

Fifty genes among the 77 analyzed have a determined mRNA 5' end. When several 5' ends were presented, 
+1 was taken to be the most distal from the ATG except when given by the authors themselves. In this way 
the mRNA sequences before the ATG have lengths between 30 and 622 bases. 

J The stop codon, determined by computer analysis by the authors, TAA in 62% of the cases, TGA in 27% 

and TAG in 11% 
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The Basics Important Elements For Translation 

There are some significant differences between prokaryotic and 
eukaryotic mRNA transcripts. Typically, eukaryotic mRNAs are 
characterized by two post-transcriptional modifications: a 5'-7 
methyl-GTP cap and a 3' poly(A) tail. Both modifications contribute 
to the stability of the mRNA by preventing degradation. 
Additionally, the 5' cap structure enhances the translation of mRNA 
by helping to bind the eukaryotic ribosome and assuring 
recognition of the proper AUG initiator codon. This function may 
vary with the translation system and with the specific mRNA being 
synthesized. The consensus sequence 5'-GCCACCAUGG-3', also 
known as the "Kozak" sequence, is considered to be the strongest 
ribosomal binding signal in eukaryotic mRNA. For efficient 
translation initiation, the key elements are the G residue at the +1 
position and the A residue at the -3 position. An mRNA that lacks 
the Kozak consensus sequence may be translated efficiently in 
eukaryotic cell-free systems if it possesses a moderately long 
S'-untranslated region (UTR) that lacks stable secondary structure. 

In bacteria, the ribosome is guided to the AUG initiation site by a 
purine-rich region called the Shine-Dalgarno (SD) sequence. This 
sequence is complementary to the 3' end of the 16s rRNA in the 
30S ribosomal subunit. Upstream from the initiation AUG codon, 
the SD region has the consensus sequence 5'-UAAGGAGGUGA~3\ 
Specific mRNAs vary considerably in the number of nucleotides that 
complement the anti-Shine-Dalgarno sequence of 16S rRNA, 
ranging from as few as two to nine or more. The position of the 
ribosome binding site (RBS) in relation to the AUG initiator is very 
important for efficiency of translation (usually from -6 to -10 
relative to the A of the initiation site). 

See " Ribosomal Binding Sites Sequence Requirements " for 

more information. 
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